Birds of a Feather Surf Together: Using Clustering Methods to Improve Navigation Prediction from Internet Log Files

نویسندگان

  • Martin Halvey
  • Mark T. Keane
  • Barry Smyth
چکیده

Many systems attempt to forecast user navigation in the Internet through the use of past behavior, preferences and environmental factors. Most of these models overlook the possibility that users may have many diverse sets of preferences. For example, the same person may search for information in different ways at night (when they are pursuing their hobbies and interests) as opposed to during the day (when they are at work). Thus, most users may well have different sets of preferences at different times of the day and behave differently in accordance with those preferences. In this paper, we present clustering methods for creating time dependent models to predict user navigation patterns; these methods allow us to segment log files into appropriate groups of navigation behaviour. The benefits of these methods over more established methods are highlighted. An empirical analysis is carried out on a sample of usage logs for Wireless Application Protocol (WAP) browsing as empirical support for the technique.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web Navigation Path Pattern Prediction using First Order Markov Model and Depth first Evaluation

Web usage mining has been defined as a technique of finding hidden knowledge from a log file. The interaction between website and user is recorded in the related web server log file. Web designer is able to analyze the file in order to understand the interaction between users and a web site, which helps to improve web topology. All information of web usage can be generated from log files and it...

متن کامل

An Efficient Agglomerative Clustering Algorithm for Web Navigation Pattern Identification

Web log mining is analysis of web log files with web page sequences. Discovering user access patterns from web access are necessary for building adaptive web servers, to improve e-commerce, to carry out cross-marketing, for web personalization, to predict web access sequence etc. In this paper, a new agglomerative clustering technique is proposed to identify users with similar interest, and to ...

متن کامل

Dominance Rank Fuzzy Clustering and Distributed Probability Graph for Web User Behaviour Mining

Web usage mining examines the navigation patterns in web access logs and extracts the past unknown and valuable information to accessed web pages. This strategies helps for different web-oriented applications such as website framework called Dominance Fuzzy Clustering and Distributed Probability Graph (DFC-DPG) is designed. The main objective is in investigating the relation of cognitive styles...

متن کامل

Dominance Rank Fuzzy Clustering and Distributed Probability Graph for Web User Behaviour Mining

Web usage mining examines the navigation patterns in web access logs and extracts the past unknown and valuable information to accessed web pages. This strategies helps for different web-oriented applications such as website framework called Dominance Fuzzy Clustering and Distributed Probability Graph (DFC-DPG) is designed. The main objective is in investigating the relation of cognitive styles...

متن کامل

Prediction of Electrofacies Based on Flow Units Using NMR Data and SVM Method: a Case Study in Cheshmeh Khush Field, Southern Iran

The classification of well-log responses into separate flow units for generating local permeability models is often used to predict the spatial distribution of permeability in heterogeneous reservoirs. The present research can be divided into two parts; first, the nuclear magnetic resonance (NMR) log parameters are employed for developing a relationship between relaxation time and reservoir poro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005